Learning Content Selection Rules for Generating Object Descriptions in Dialogue
نویسندگان
چکیده
A fundamental requirement of any task-oriented dialogue system is the ability to generate obje t des riptions that refer to obje ts in the task domain. The subproblem of ontent sele tion for obje t des riptions in task-oriented dialogue has been the fo us of mu h previous work and a large number of models have been proposed. In this paper, we use the annotated o onut orpus of task-oriented design dialogues to develop feature sets based on Dale and Reiter's (1995) in remental model, Brennan and Clark's (1996) on eptual pa t model, and Jordan's (2000b) intentional in uen es model, and use these feature sets in a ma hine learning experiment to automati ally learn a model of ontent sele tion for obje t des riptions. Sin e Dale and Reiter's model requires a representation of dis ourse stru ture, the orpus annotations are used to derive a representation based on Grosz and Sidner's (1986) theory of the intentional stru ture of dis ourse, as well as two very simple representations of dis ourse stru ture based purely on re en y. We then apply the rule-indu tion program ripper to train and test the ontent sele tion omponent of an obje t des ription generator on a set of 393 obje t des riptions from the orpus. To our knowledge, this is the rst reported experiment of a trainable ontent sele tion omponent for obje t des ription generation in dialogue. Three separate ontent sele tion models that are based on the three theoreti al models, all independently a hieve a ura ies signi antly above the majority lass baseline (17%) on unseen test data, with the intentional inuen es model (42.4%) performing signi antly better than either the in remental model (30.4%) or the on eptual pa t model (28.9%). But the best performing models ombine all the feature sets, a hieving a ura ies near 60%. Surprisingly, a simple re en y-based representation of dis ourse stru ture does as well as one based on intentional stru ture. To our knowledge, this is also the rst empiri al omparison of a representation of Grosz and Sidner's model of dis ourse stru ture with a simpler model for any generation task.
منابع مشابه
Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation
We tackle the sub-task of content selection as part of the broader challenge of automatically generating image descriptions. More specifically, we explore how decisions can be made to select what object instances should be mentioned in an image description, given an image and labelled bounding boxes. We propose casting the content selection problem as a learning to rank problem, where object in...
متن کاملA dialogue approach to learning object descriptions and semantic categories
Acquiring new knowledge through interactive learning mechanisms is a key ability for humanoid robots in a natural environment. Such learning mechanisms need to be performed autonomously, and through interaction with the environment or with other agents/humans. In this paper, we describe a dialogue approach and a dynamic object model for learning semantic categories, object descriptions, and new...
متن کاملGenerating Tailored, Comparative Descriptions in Spoken Dialogue
We describe an approach to presenting information in spoken dialogues that for the first time brings together multi-attribute decision models, strategic content planning, state-of-the-art dialogue management, and realization which incorporates prosodic features. The system selects the most important subset of available options to mention and the attributes that are most relevant to choosing bet...
متن کاملLearning the Use of Discourse Markers in Tutorial Dialogue for an Intelligent Tutoring System
Usage of discourse markers in tutorial language can make the difference between stilted and natural sounding dialogue. In this paper we describe some simple rules for selection of discourse markers. These rules were derived for use in an intelligent tutoring system by applying decision-tree machine learning to human tutoring language. The fact that these selection rules operate within the envir...
متن کاملCollaborative Models for Referring Expression Generation in Situated Dialogue
In situated dialogue with artificial agents (e.g., robots), although a human and an agent are co-present, the agent’s representation and the human’s representation of the shared environment are significantly mismatched. Because of this misalignment, our previous work has shown that when the agent applies traditional approaches to generate referring expressions for describing target objects with...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Artif. Intell. Res.
دوره 24 شماره
صفحات -
تاریخ انتشار 2005